AITopics | lqr problem

Abstract-- We derive closed-form extensions of Riccati's recursions (both sequential [4] and parallel [7]) for solving dual-regularized LQR problems. We show how these methods can be used to solve general constrained, non-convex, discrete-time optimal control problems via a regularized interior point method, while guaranteeing that each primal step is a descent direction of an Augmented Barrier-Lagrangian merit function. We provide MIT -licensed implementations of our methods in C++ and JAX. Numerical optimal control, both real-time and offline, has found numerous application domains, ranging from trajectory optimization for robotics (e.g. for autonomous cars, unmanned aerial vehicles, legged robots) and airspace (e.g. Continuous-time optimal control problems, whose optimization variables are functions (thus infinite-dimensional) are typically converted into finite-dimensional optimization problems by either shooting (i.e.

artificial intelligence, optimal control problem, optimization problem, (13 more...)

arXiv.org Artificial Intelligence

2509.1637

Genre: Research Report (0.50)

Industry:

Information Technology > Robotics & Automation (0.54)
Transportation > Passenger (0.34)
Automobiles & Trucks (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.87)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.54)

Add feedback

f4f6dce2f3a0f9dada0c2b5b66452017-Supplemental.pdf

Neural Information Processing SystemsAug-18-2025, 21:46:35 GMT

artificial intelligence, lin, machine learning, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > New York > New York County > New York City (0.14)
North America > United States > New Jersey (0.04)
North America > United States > California > Alameda County > Berkeley (0.04)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.45)

Add feedback

Stabilizing Dynamical Systems via Policy Gradient Methods Juan C. Perdomo University of California, Berkeley Jack Umenberger MIT Max Simchowitz MIT

Neural Information Processing SystemsAug-18-2025, 21:46:30 GMT

Stabilizing an unknown control system is one of the most fundamental problems in control systems engineering. In this paper, we provide a simple, model-free algorithm for stabilizing fully observed dynamical systems. While model-free methods have become increasingly popular in practice due to their simplicity and flexibility, stabilization via direct policy search has received surprisingly little attention. Our algorithm proceeds by solving a series of discounted LQR problems, where the discount factor is gradually increased. We prove that this method efficiently recovers a stabilizing controller for linear systems, and for smooth, nonlinear systems within a neighborhood of their equilibria. Our approach overcomes a significant limitation of prior work, namely the need for a pre-given stabilizing control policy. We empirically evaluate the effectiveness of our approach on common control benchmarks.

artificial intelligence, controller, machine learning, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Alameda County > Berkeley (0.40)
North America > United States > New Jersey (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

9d94bf4711fa459812437e5df5978551-Supplemental-Conference.pdf

Neural Information Processing SystemsAug-17-2025, 06:52:11 GMT

algorithm, artificial intelligence, machine learning, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > New York > New York County > New York City (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report > New Finding (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.30)

Add feedback

9d94bf4711fa459812437e5df5978551-Paper-Conference.pdf

Neural Information Processing SystemsAug-17-2025, 06:52:07 GMT

algorithm, artificial intelligence, machine learning, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > New York > New York County > New York City (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report > New Finding (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.32)

Add feedback

Some remarks on gradient dominance and LQR policy optimization

Sontag, Eduardo D.

arXiv.org Artificial IntelligenceJul-17-2025

Solutions of optimization problems, including policy optimization in reinforcement learning, typically rely upon some variant of gradient descent. There has been much recent work in the machine learning, control, and optimization communities applying the Polyak-Łojasiewicz Inequality (PLI) to such problems in order to establish an exponential rate of convergence (a.k.a. ``linear convergence'' in the local-iteration language of numerical analysis) of loss functions to their minima under the gradient flow. Often, as is the case of policy iteration for the continuous-time LQR problem, this rate vanishes for large initial conditions, resulting in a mixed globally linear / locally exponential behavior. This is in sharp contrast with the discrete-time LQR problem, where there is global exponential convergence. That gap between CT and DT behaviors motivates the search for various generalized PLI-like conditions, and this talk will address that topic. Moreover, these generalizations are key to understanding the transient and asymptotic effects of errors in the estimation of the gradient, errors which might arise from adversarial attacks, wrong evaluation by an oracle, early stopping of a simulation, inaccurate and very approximate digital twins, stochastic computations (algorithm ``reproducibility''), or learning by sampling from limited data. We describe an ``input to state stability'' (ISS) analysis of this issue. The second part discusses convergence and PLI-like properties of ``linear feedforward neural networks'' in feedback control. Much of the work described here was done in collaboration with Arthur Castello B. de Oliveira, Leilei Cui, Zhong-Ping Jiang, and Milad Siami.

artificial intelligence, convergence, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2507.10452

Country: North America > United States (0.68)

Genre: Research Report (0.50)

Industry: Government > Military (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.34)

Add feedback

Filters

Collaborating Authors

lqr problem

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

f4f6dce2f3a0f9dada0c2b5b66452017-Supplemental.pdf

9d94bf4711fa459812437e5df5978551-Supplemental-Conference.pdf

9d94bf4711fa459812437e5df5978551-Paper-Conference.pdf

Regret Bounds for Robust Adaptive Control of the Linear Quadratic Regulator

Dual-Regularized Riccati Recursions for Interior-Point Optimal Control

f4f6dce2f3a0f9dada0c2b5b66452017-Supplemental.pdf

Stabilizing Dynamical Systems via Policy Gradient Methods Juan C. Perdomo University of California, Berkeley Jack Umenberger MIT Max Simchowitz MIT

9d94bf4711fa459812437e5df5978551-Supplemental-Conference.pdf

9d94bf4711fa459812437e5df5978551-Paper-Conference.pdf

Some remarks on gradient dominance and LQR policy optimization